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Abstract 

Let S n be the set of all permutations on [n] := {1,2,..., n}. We 
denote by n n the smallest cardinality of a subset A of 5 n +i that "cov- 
ers" S n , in the sense that each n S S n may be found as an order- 
isomorphic subsequence of some ir' in A. What are general upper 
bounds on k„? If we randomly select u n elements of S n +i, when does 
the probability that they cover S n transition from to 1? Can we 
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provide a fine-magnification analysis that provides the "probability of 
coverage" when v n is around the level given by the phase transition? 
In this paper we answer these questions and raise others. 

1 Introduction 

The problem discussed in this paper was posed by Prof. Robert Brignall 
during the Open Problem Session at the International Permutation Patterns 
Conference held at California State Polytechnic University in June 2011. The 
conference webpage may be found at 



http : //www . calpoly . edu/~math/PP2011/ 



Brignall asked, "If S n denotes the set of all permutations on [n] := {1, 2, . . . , n}, 
what can we say about K n , the smallest cardinality of a subset A of S n+ i 
that "covers" S n , in the sense that each it G S n may be found as an order- 
isomorphic subsequence of some n' in A" Specifically, he asked about bounds, 
exact values, and asymptotics. Small values are easy to calculate; e.g., it is 
easy to see that K\ — 1, Ki — 1, and the permutation set {1342, 4213} reveals 
that «3 = 2 - but the situation rapidly gets out of precise control. 

In Section 2, we provide a trivial pigeonhole lower bound on n n and then 
use the "method of alterations" [1] to derive an upper general bound on K n 
that contains a logarithmic factor that often shows up when using such argu- 
ments in covering-type situations; see, e.g. the general upper bound on the 
size of covering designs that was proved by Erdos and Spencer in their early 
work [3]. We continue by showing that the second and subsequent coverings 
of the n-permutations are accomplished in linear log log time, in a result that 
is reminiscent of the ones in [8] (covering designs) and [7] (^-covering arrays). 
Lastly, in Section 2, we make comparisons to the development in Spencer 
[T2] to produce evidence that the asymptotic value of K n is (upto a 0(1) 
or perhaps 1 + o(l) factor) the same as that given by the lower bound; we 
conjecture that this is true. 

In Section 3, we switch to a different approach, asking the question "If 
we randomly select v n elements of S n+ i, when does the probability that they 
cover S n transition from asymptotically zero to asymptotically 1?" Our 
main result in this area, Theorem 6, is proved using the Janson exponential 
inequality ([I]). The threshold in Theorem 6 contains a small gap, but the 
result is the best possible, as we show in Section 4, where the Stein-Chen 



method of Poisson approximation is used to prove, that in the regime of 
u n s that are "in the gap," the number of uncovered n-permutations X has a 
Poisson distribution with finite mean, and thus F(X = 0) is a finite constant 
that is bounded away from zero and one. 

2 Bounds 

Our first preliminary result provides a formula for the number c(n, it) of 
permutations in S n+ i that cover a fixed it G S n . 

Lemma 1. Let c(n, it) denote the number of permutations in S n+ i that cover 
a fixed it G S n . Then c(n, it) = c(n, it') = n 2 + 1 for each it, it' G S n . 

Proof. It is clear that any permutation pattern it G S n may be realized in 
\ ) = n + 1 ways, one for any choice of n numbers from {1, 2, . . . , n + 1}. 
Arrange these ways lexicographically (for example if n = 3, we can realize 
the pattern 132 as 132, 142, 243, and 143, or, lexicographically, as 132, 142, 
143, 243). Note that the rth and (r + l)st lex-orderings of it differ in a single 
bit. Now, given any realization of it, the (n + l)st letter may clearly be 
inserted in (n + 1) ways to create an (n + l)-covering permutation; however, 
for any 1 < r < n — 1, the list of covering (n + l)-permutations for the rth 
and (r + l)st lex-orderings have an overlap of magnitude 2, corresponding to 
whether the (n + l)st letter is inserted before or after the non-matching bit. 
Thus c(n, it) = c(n, it') = {n + l) 2 — In = n 2 + 1, as asserted. □ 

Lemma 1 can now be used to prove 

Theorem 2. 

(n + 1)! , logrt 



(1 + o(l)) < K n < ^(n + 1)!(1 + 0(1)). 
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Proof. The lower bound is elementary. Each of the n n permutations in a 
covering, "takes care," with repetition, of ( n ^ ) = n + 1 n-permutations. 
Since we have a covering, clearly, K n {n + 1) > n\, or 



n! (n + 1)! / 2n 

K n > — = 1 — 



(n + 1) n 2 \ (n + 1) 



For the upper bound, we use the method of alterations pQ as follows: Choose 
a random number Y of (n + 1) -permutations by "without replacement" sam- 
pling. The expected number of uncovered n-permutations, by Lemma 1 and 
linearity of expectation, is 

/(n+l)\-n 2 -l\ 
E W = «'■ („■+!)■) - 

We choose a realization with X = Xy < E(X) and cover these with at most 

/ n 2 + 1 \ y 
n!(l-y ^yj <n!exp{-y(n 2 + l)/(n+l)!} 

additional (n + l)-permutations, yielding, for any initial size Y, a covering 
with at most 

Y + n\ exp{-F(n 2 + l)/(n + 1)!} 

members. Minimizing over Y yields an initial choice of size 

(ra + 1)!, /n 2 + l 
log 



{n 2 + 1) ta V n + 1 
and an upper bound of 



Q + l)! / / n 2 + l \\ log 7i i 

(n 2 + 1) V g U + 1 / / ~ n 2 



^SS i + "* ^ ^<» + W + <KD). 



as claimed. D 

We next ask how many more (n + l)-permutations are required to cover 
the n permutations multiple times. Here the situation is often nuanced, 
and leads to the question as to whether the logarithmic factor, present in 
the upper bound for the first covering, is extraneous. Typically, in other 
covering contexts, we find that the second and subsequent coverings need an 
appropriately normalized log log n additional elements in the cover; see, e.g. 
[8] for a covering design analogy, [7] for an occurrence of this phenomenon in 
t-covering arrays, and jl], [9] for the log log behavior in the coupon collection 
problem. We briefly describe the parallel in the context of covering designs: 
A collection A of sets of size k of [n] is said to form a t-covering design if each 
t-set is contained in at least one fc-set in A. If m(n, k, t) denotes the smallest 



size of a t-covering design A then it is clear that m(n, k,t) > (™)/( t ); Erdos 
and Spencer proved in [3] that \/n,k,t, 



( n ) ( 
m(n,k,t)<p-( 1 + log 



it was shown furthermore in [8] that the minimum number m(n,k,t,X) of 
fc-sets needed to cover each t-set A times satisfied 
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m(n, k, t, A) < p- I 1 + log ( 1 + (A - 1) log log II + 0(1] 

n,k,t — >■ oo. This was the log log result. Also, the Erdos-Hanani conjecture, 
namely that for fixed fc, t, 

m(n,k,t) 1 

was proved by Rodl [11] and, later, by Spencer [12]. This showed that the 
logarithmic factor in the Erdos-Spencer bound could be asymptotically dis- 
pensed with. Finally, see [3] for a corresponding threshold result. It is these 
questions we seek to address, in our context, in the rest of this section and 
the next. 

Theorem 3. Let k Uj \ denote the minimum number of (n + 1) -permutations 
needed to cover each n-permutation A > 2 times. Then, 

(n + IV 
Kn,* < ^—^ (logn+ (A - 1) log log n + 0(1)) . 

Proof. We first choose an unspecified number Y of (n + l)-permutations ran- 
domly and with replacement. This might lead to replication with very small 
probability, but the proof is far more streamlined - and can easily be adapted 
to the case where we choose Y distinct permutations. The probability that 
any permutation is covered just j times; < j < A — 1 is 

Y\ ( n 2 + 1 A j / n 2 + 1 x y ~ j 
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j J \(n + 1)!/ V (n+1 
so that the expected number of such permutations is 

Y\ ( n 2 + l \ j / n 2 + l ^ Y ~ j 



E(X i )=nl[ t —7 1-7 —7 ;0<j<A-l. 



We can cover each such permutation in any ad hoc way by choosing X — j ad- 
ditional (n + l)-permutations, and so the number Zy of (n + l)-permutations 
in this successful A-covering is 



A-l 

\ _ n\ Y 

J' 



j=0 



and thus 

«.=-&-„ C )(^)'(.-^)" 

for any initial choice of Y permutations. Set 

n 2 + 1 
(n + 1)! 

and, given Y Bernoulli trials with success probability p, denote the cumula- 
tive and point binomial probabilities by B(Y,p, k) and b(Y,p, k) respectively, 
< k < Y. It is easy to verify that for Yp > X 

E(Z Y ) = Y + n\XB(Y,p,X-l)-n\YpB(Y -l,p,X-2) 

= Y + n\Xb(Y, Pl X - 1) + n\ {XB(Y lP , X - 2) - YpB(Y -l,p,X- 2)} 

< Y + n\Xb(Y,p, X - 1) + n\X{B(Y,p, X - 2) - S(y - l,p, A - 2)} 

= y + nUft^PjA-lJ+nU^^TjA^" 1 ^ ( tt^- - 1 ) 

< y + n!A6(y,p,A-l) 

We do not attempt to optimize carefully in (1); rather we set 

(n + 1)! 
y = — 5 — — (logn + (A - 1) log log n) 
n z + 1 



and see that the second term T 2 in (1) reduces as 



To = n\X 



A-l 



logn + (A — 1) loglognA exp{— logn — (A — 1) loglogn} 



q J (A-l)! 

Ti + 1)! A 1 /logn + (A — 1) loglogn 



n 2 + l (A-l)! (logn)^ 1 V q 

(n + 1)! A 



(l + o(l)) 



n 2 + l (A-l)! 
so that (1) yields 



;i+o(i)), 



(n + IV 
E(Z y ) < \ ': ( logn + (A - 1) loglogn + (1 + o(l))- 



n 2 + l V "(A-l)! 

Finally, we choose a sample outcome for which |^4| < E(Zy) to complete the 
proof. □ 

We now describe the hypergraph formulation of [10] that was used in 
[12] to prove the Erdos-Hanani conjecture using a method that involved 
branching processes, dynamical algorithms, hypergraph theory, and differ- 
ential equations. In this formulation the vertices of the hypergraph consisted 
of the ensemble of t-sets; for us they would be the class of permutations in 
S n . The edges in [T2] were the collections of t-subsets of the /c-sets, so that 
the hypergraph was (.) uniform. If analogously, we let edges be the set of 
n-permutations covered by an (n + l)-permutation, then the hypergraph is 
no longer uniform. It is not too hard to prove, however, that each (n + 1)- 
permutation it covers n + 1 — s n n-permutations, where s^ is the number of 
successions in n, where a succession is defined as an episode ir(i+l) = 7r(z)±l. 
Moreover, we know [B] that the number of successions in a random permu- 
tation is approximately Poisson with parameter ~ 2, so that it is reasonable 
to assert that most hypergraph edges consist of n — 0(1) vertices. This is 
the first deviation from the Pippenger model, which we consider to be not 
too serious insofar as the lack of uniformity of the hypergraph is concerned 
but rather serious due to the fact that the uniformity level n — 0(1) is not 
finite. Lemma 1 above shows that the degree of each vertex is 0(n 2 ), and we 
will prove in Lemma 5 below that the codegree of two vertices it and it' is at 
most 0(1), so that the codegree is an order of magnitude smaller than the 
degree. This is good. The above problems with the hypergraph formulation 
notwithstanding, we make the following conjecture: 



Conjecture 4. For some constant A, 



and possibly A < 2. 

3 Thresholds 

If each permutation in S n+ \ is randomly and independently picked with prob- 
ability p, we will show that the probabilistic zero-one threshold for coverage 
of S n is at the level p = log n/n, i.e., at n times the level given by the upper 
bound in Theorem 2. Towards this end we prove the following result, of 
interest in its own right. 

Lemma 5. For any tt G S n , the set 

Jn := {tt' £ S n : Ti and tt' can be jointly covered by p G SVi.+i} 
has cardinality at most n 3 . Moreover, for any it, tt' G S n , the cardinality of 

Ctt.tt' := {p G S n+ i : p covers both n and tt'} 
is at most 4. 

Proof. Fix it. For an (n + l)-permutation to be able to successfully cover 
another it' G S n (in addition to it), it must contain an (n — l)-subpattern of 
it' . This subpattern may be present in ( "J = n possible positions of it, and 
can be represented, using the numbers {1,2, ... ,n}, in n ways. Finally, the 
nth letter of it' can be inserted into this subpattern in n ways. This proves 
the first part of the lemma; we observe that the n 3 bound is probably not 
best and try much harder to get the correct bound in the second part. 

To prove the second part, we note that p G S n+ \ is able to success- 
fully cover two co-coverable permutations it and it' only if there exists an 
insertion of the symbol into the sequences it = {tt\, tt 2 , ■ ■ ■ , vr n ) and n' = 
(7t[, tt' 2 , . . . , ir' n ), so that when the augmented sequences ir% = o = (a±, oi,..., a n +i) 
and 7T0 = a' = (o"i, cr' 2 , ■ ■ ■ , o~' n+l ) are laid atop each other, the following con- 
ditions hold: 
(i) There is a minimum index h such that 

Vin+i = < +1 = n > °i n = < = n - 1, . . . , a ih = a' ih =h-l, 



(we set h = n + 2 if no such index exists); 

fii) Either £ = or else there is a maximum index £ such that 



(iii) Without loss of generality (this is the "cr < a'" solution), 

°i h -i = h-2; a' ihi = 0; 
(iv) and, lastly, 

a i .=j-l-y i .=j-e + 2<j<h-2. 

The basic reason for this is that in the covering (n + 1) -permutation p, the 
numbers 1 and n+1 may only play the role of 1 and n respectively in either 
of the permutations it, it', while any other number t in p may play the role of 
either t or t - 1. Let us give an example. If it = 126983745; it' = 127938645, 
then a legal insertion of the symbols, with h = 10 and I — 5, is as follows: 



a 

a' 

P 



1269837045 
1279038645 
127 10 93864 5, 



whereas a "bad" insertion would be, e.g., 



a: 1269083745 

a' : 127938064 5, 



since, starting with 10 representing the two 9's and 9 representing the two 
8's - we find that there is no position, in p, where the 8 can be placed. 

Notice also that if the arrangements a and a' are legal, then they may 
be laid atop each other, by simultaneous rearrangement, so that each is left- 
right non-decreasing; and the covering (n + 1) -permutation is id n+ \. If the 
arrangements are not legal, then such a rearrangement leads to sequences 
that are not both non-decreasing. For instance, the legal example above can 
be rearranged as follows: 

9 



a: 1234506789 

a : 1234567809 



We shall prove that for any two co-coverable permutations tt, tt', there 
are at most two legal insertions of the symbols, for each of which there can 
be at most two covering permutations. First, align any two co-coverable n 
and ■k' in n — 1 spots, and introduce two symbols so that (i) through (iv) 
above are satisfied. It is clear that this can be done, because there exists a 
p which covers both n and ir', and deletion of two symbols from p yields a 
(n — l)-subpattern shared by ir, n'. The question is when this process is non- 
unique. It will be useful to introduce the notion of longest common match, 
or LCM, between two permutations. Given two co-coverable permutations 
it is easy to determine which one is the "larger" or "dominant" one. Call 
any dominant permutation ir' and the other ir (ir' may not be unique, as 
in Example (ii) below.) When it is laid atop ir', we identify an LCM, as a 
sequence of (n — 1) index pairs (mi,pi), or, alternatively, the values (vr mi , 7r'); 
1 < irii < m 2 < ■ ■ ■ < Tn n -i < n, 1 < pi < p 2 < . . . < p n -i < n, for which 
7r' — 7r m . G {0, 1}. There exist (n — 1) necessarily non-intersecting straight 
lines that connect the (7r m .,7r' ) pairs. These lines are either vertical or 
oblique, and the shape of the entire LCM looks either like 

or like 

The mismatched positions in n, ir', i.e., those that do not belong to an (mj, Pi) 
pair, may be (i) such that they can be connected without intersecting any 
other line (this occurs when there are no oblique lines); or (ii) immediately 
to different sides of a matched pair (there is one oblique line), or (iii) farther 
apart (at least two oblique lines), as seen in the examples below. In the first 
example, 

tt: 3874296 10 51 
tt' : 3 4 8 5 2 9 7 10 6 1, 
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1 


2 


6 


9 


8 


3 


7 


4 


5 


1 


2 


7 


9 


3 


8 


6 


4 


5 



Example (i) 

the mismatch could be considered to be between the 8 (in tt) and the 4 (in 
it'); or we have 

tt: 219347865 

tt' : 21937486 5, 

Example (ii) 

where the mismatch could be viewed as being between the two 7s or between 
the two 4s; and finally in the last example 

TT 

it' : 

Example (Hi) 
the 8 in tt and the 6 in tt' are mismatched. Example (iii) contains the same 
permutations as the ones introduced earlier; utilizing the notion of LCMs 
and mismatches, we have the (TT mi ,iT' p .) pairs consisting of 

(1,1), (2, 2), (6, 7), (9, 9), (3, 3), (7, 8), (4, 4), (5, 5). 

The skipped symbols, 8 in tt and 6 in tt', provide the mismatch - and the 
corresponding covering (n + l)-permutation can be constructed from the 
above matching by introducing the symbols to appear below or above the 
mismatched numbers exactly as before. Thus, the two ways of represent- 
ing co-coverable permutations - using either the symbols or LCMs, are 
equivalent, and we will use one or the other as appropriate. 

Given a mismatch in which the two numbers are the same, say a, they 
may be represented by either a or a + 1 in the covering (n + l)-permutation. 
This gives two coverings depending on where a is placed. But the common 
mismatched number may be chosen in up to two ways, as in Example (ii), 
for a total of 4 covering permutations; this maximal possibility exists if and 
only if tt' is obtained from tt by a single swap of adjacent elements. If the 
two mismatched numbers a, b are different, the case where these are "atop" 
each other (as in Example (i)) yields two covering permutations, since the 
symbols may be inserted in the two alternate forms 



or 
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which lead to different covering ps. If the mismatched numbers are farther 
apart and different, then it is clear that there is only one covering permuta- 
tion. 

In what follows, we will denote the positions of potential mismatches by 

(a, (3), (7, 5), and set 

a = 7i a , b = 7io] c = 7ij, d = 1X5. 

We now need to show that all possibilities have been covered, in other words, 
that there do not exist two LCMs with mismatched pairs at positions (a, (3) 
and at (7, 5), where c, d are not obtained from a, b by the processes described 
in the previous paragraph, and where c, d yield genuinely different covering 
permutations. First observe that alternative choices of c and d do exist. For 
instance, in Example (i), c and d may be taken to be 7 and 4; or 7 and 3; or 
8 and 3, but these do not yield ps that are different from the 4 alignment. 
We proceed by contradiction, and by considering all possible ways in 
which the mismatched pairs (a, b) and (c, d) can interact. First note that a, b 
must satisfy a > b in order the counteract the fact that all it' > 7T m . for all 
i. Moreover, if a = b then the LCM matches identical elements. 

CASE 1: Mismatches a and b are atop each other; mismatches c and d are 
also atop each other; c and d are to the right of a and b respectively. 

Since a and b are atop each other, we must have vr~ 1 (z) = 7r'~ 1 (i) for 
each i < b — l;z > a + 1; tt^ 1 ^) 7^ ir'^ 1 {i) for each b < i < a (assuming 
without loss of generality that a ^ b). Similarly, i and i are matched for each 
i < d — 1; z > c + 1 and not for other is. This forces [b, a] D [d, c] = 0, and 
thus 7r = 7r', a contradiction. 

CASE 2: a and b are atop each other; c and d are to the right of a and b 
respectively. 

Assume that d is to the right of c. c and d are initially matched to elements 
below and above it respectively. When the mismatched pair becomes {c,d), 
all matches between c and d are tilted to the right, e.g., the number below 
the c is now matched to the number to the right of c. Since c > d, and 
since % G vr' may only be matched with either i or i — 1 in n, we see that the 
sequences of integers between 7 and 5 are identical in ir and it', and consist 
of a monotone decreasing block of consecutive integers. It follows that all 
integers lower than d in it are matched to themselves in it', as are all integers 
higher than c. This forces both it and it' to be the same permutation of the 
identity, a contradiction. 

12 



CASE 3: a, (3, 7, 5 are in increasing order from left to right. 

We proceed as in Case 2, except that there are now two monotone blocks 
- between a and b; and between c and d. 

CASE 4: The positions appear in the order a, 7, (3, 5 or one of its variants. 

Assume that the (a, b) LCM has oblique lines that go from bottom left to 
top right. We see that the oblique lines corresponding to the (c, d)-mismatch 
LCM start between a and (3 but continue beyond (3. If, however, 7 > a + 1, 
or 5 > (3 + 1, we see that there are again one or two monotone blocks, forcing 
a contradiction. Thus a = 7 and (3 = 5, which reduces us to one mismatch. 
We thus need to consider when a = (3 = 7 or a = [3 = 5, as in Example (i), 
with the (7,4) or (8,3) mismatches. In this case, either a has to be compatible 
with both f3, 5; or (3 has to be compatible with both a, 7. The same covering 
permutations thus result with both sets of mismatches. 

CASE 5: The positions appear in the order 7, a, (3, 5 or one of its variants. 

Here a contradiction arises ii / ~f<a — 2oi5>(3 + 2. Assume, therefore, 
that 7 = a — 1; 5 = (3 + 1; a and (3 could be arbitrarily far apart, but the 
point is that the a — b mismatch could be rewired to become a, c — b or a — d or 
even a. c — d mismatch, as illustrated in Example (i), with the 7-3 mismatch. 
The fact that a and (3 could be arbitrarily far apart is illustrated by the 
example below: 

tv : 569 10 11 12 478123 

tt' : 6 9 10 11 12 3 5 7 8 1 2 4, 

Example (iv) 
where the (6,3) mismatch can be rewired to become a (5,3) mismatch without 
changing the covering permutation. If 7 = a — 1; 8 = /3 + 1; ac-d rewiring 
may lead to up to two entirely new ps, as in Example (ii). 

This completes the proof of Lemma 5. □ 

We are now ready to prove the main result of this section. 

Theorem 6. Consider the probability model in which each n G S' n+ i is in- 
dependently picked with probability p. Let the resulting random collection of 
permutations be denoted by A. Then, 

loen 
p < -2-(l + o*(l)) => P(A is a cover of S n ) -> (n -> 00), 
n 

13 



and 

los n 
p > -2— (1 + o(l)) => PU is a cover of 5 n ) -> 1 (n ->■ oo). 
n 

Proof. We use the Janson inequalities, see pp. By the lower Janson inequal- 
ity, we have, with X denoting as before the number of uncovered n G S n , 

¥(X = 0) > T P(permutation i is covered) 

i=i 

> ni-(i-pr a 

i=l 

n! 

> Y[exp{-(l-p) n2 /(l-(l- P r 2 )} 



exp{-n!(l - p) n /(l - (1 - p) n ) ->• 1 



if E(X) -} 0, or, using Stirling's formula, if Cy/n (j) n e n2p -)■ 0, i.e., if 
logn 1 logn w(n) logn 

p = + -7TT" + ~^r = x + °( 1 ))' 

n n 2n z n A n 

where u(n) — > oo is arbitrary. This proves the second part of the result. To 
prove the first part, we invoke the upper Janson inequality to give 

P(X = 0) < exp{-A + A} (2) 

where A = E(X) and 

A = 2_. / v F(i and j are both uncovered), 

i j~i 

and z ~ j if permutations i and j can be covered by the same (n + 1)- 
permutation. Since, by Lemma 5, for any i there are at most n 3 permutations 
j that can be jointly covered with i, and, if this is the case, there are at most 
4 covering (n + l)-permutations, it follows that 

A = n\n 3 (l-p) 2n2+2 -\ 
14 



so that (2) yields 

F(X = 0) < exp{-n!(l - p) n2+1 + n\n 3 {l - p) 2n ^ 2 } ->• 
if n!(l — p) n — >■ oo, i.e., if 

]ogn-l + I -l— U = lggn 

n n 

where w(n) — > oo is arbitrary. This completes the proof of Theorem 6. D 



4 Poisson Approximation in the "Gap" 

The proof of Theorem 6 reveals that K(X) undergoes a rapid transition when 
p is around the level given by (3). In fact, if 

logn-l + i^-^ 

p = — 2_a ^, K G E, 

n 

then for large n, E(X) ~ V^e~ K and P(X = 0) ~ expj-v^e"^}. Much 
more is true, however, as we shall show next: The entire probability distri- 
bution C(X) of X can be approximated, in the total variation sense, by that 
of a Poisson random variable with mean A = K(X) in a range of ps that 
allows for large means. This result can be thought of as being a probabilistic 
counterpart to Theorem 6, and is proved using the Stein-Chen method of 
Poisson approximation [2|: 

Theorem 7. Consider the model in which each n G S n+ \ is independently 
chosen with probability p, thus creating a random ensemble A of {n + 1)- 
permutations. Then d Ty (£(X),Po(\)) -> if p > ^(1 + e), where Po(A) 
denotes the Poisson distribution with parameter X, e > is arbitrary, and 
the total variation distance d^y is defined by 

d TY (C(Y),C(Z)) = sup \¥(Y G A)-¥(Z G A)\. 

ACZ+ 

Proof. As before, we set X = Y^j=i hi w ^h Ij = 1 if the j'th n-permutation 
is uncovered by permutations in A (Ij — otherwise), and let A = E(X) = 
n!(l — p) n +1 . Consider the following coupling, for each j: If Ij = 1, we 
"do nothing," setting J t = Jji — I i: 1 < i < n\. If, on the other hand, 
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the jth permutation is covered by one or more (n + 1) -permutations in A, 
we "deselect" these permutations, setting J; = 1 if the ith permutation is 
uncovered after this change is made; Jj = otherwise. Now it is clear that 
Ji > Ii for i 7^ j, since there is no way that an uncovered permutation can 
magically get covered after a few (n + l)-permutations are deselected. Also, 
setting N = n\, we have for each j, 

£(Ji, J2, ■ ■ ■ , Jn) = £(A, • • • , In\Ij = !)• 
Corollary 2.C.4 in j2] thus applies, telling us that 

rf TV (£(X),Po(A)) < l -^ (\{X) - A + 2^P 2 (J, = 1) j , (4) 

Bounding 1 — e~ A by one, (4) yields 

dTv(£(X), Po(A)) < ^p- - 1 + 2(1 - pf +1 . (5) 

The last term in (5), namely (1 — p) n +1 can easily be verified to tend to zero 
as long as p ^> 1/n 2 , so we turn to a computation of V(X): 

v(x) = j2 {m) - e 2 ^)} + E^^) - E (W)}> 

so that 



i~7 



V(X) Ei~ 3 -{E(^j)-E(^)E(^)} 

A A 

< n 3 ((l-p) n2 - 3 -(l-p) n2+1 

= n\l-p) n2+l {{l-p)~ A -l] 

< 5pn 3 exp{— n 2 p] — > 

provided that p > A 1 ^, with A > 1. D 



5 Open Problems 

Proving the best possible result along the lines of Conjecture 4 would clearly 
be a most desirable outcome of this work. Can, e.g. we write "lim" instead of 
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"limsup"? Is A = 1? Secondly, can the first bound in Lemma 5 be improved? 
(Note, however, that such an improvement would provide only marginal im- 
provements in Theorems 6 and 7.) Finally, extending the results of this 
paper to encompass coverings of n permutations by (n + k) -permutations 
would lead to several interesting questions and new techniques; for example, 
Theorem 2 can readily be generalized for k > 2. Also, the maximum degree 
of the dependency graph induced by the indicator random variables {IA- is 
low even for k > 2, so that these variables are almost independent, and one 
may even envision tight results as k — > oo. 

6 Acknowledgment 

The research of all four authors was supported by NSF Grant 1004624. 

References 

[1] N. Alon and J. Spencer, 1992. The Probabilistic Method, John Wiley, 
New York. 

[2] A. Barbour, L. Hoist, and S. Janson, 1992. Poisson Approximation, 
Oxford University Press. 

[3] P. Erdos and J. Spencer, 1974. Probabilistic Methods in Combinatorics, 
Academic Press, New York 

[4] P. Flajolet and R. Sedgewick, 2009. Analytic Combinatorics, Cambridge 
University Press. 

[5] A. Godbole and S. Janson (1996). "Random covering designs," J. Com- 
binatorial Theory, Series A 75, 85-98. 

[6] A. Godbole and P. Sissokho (1999). "Near-matches and successions in 
random permutations," Congressus Numerantium 135, 159-170. 

[7] A. Godbole, D. Skipper, and R. Sunley (1996). "t-covering arrays: upper 
bounds and Poisson approximations," Combinatorics, Probability and 
Computing 5, 105-118. 



17 



[8] A. Godbole, S. Thompson, and E. Vigoda (1996). "General upper 
bounds for covering numbers," Ars Combinatoria 42, 211-221. 

[9] A. Myers and H. Wilf (2003). "Some new aspects of the coupon collec- 
tor's problem," SIAM J. Discrete Mathematics 17, 1-17. 

[10] N. Pippenger and J. Spencer (1989). "Asymptotic behaviour of the chro- 
matic index for hypergraphs," J. Combinatorial Theory, Series A 51, 
24-42. 

[11] V. Rodl (1985). "On a packing and covering problem," European J. 
Combinatorics 5, 69-78. 

[12] J. Spencer (1995). "Packing and covering via a branching process," 
Rand. Structures Algorithms 7, 167-172. 



